Transcript Sequence Registry

نویسندگان

  • Junping Jing
  • Jian Jiang
  • Jeffrey S. Aaronson
چکیده

In many computational genomic studies, including large-scale expression profiling, it is useful to have a comprehensive collection of transcript sequences. The Reference Sequence [NCBI, 1999] project at NCBI is an effort to identify a non-redundant set of full-length mRNA sequences contained within GenBank [Benson et al., 1999] entries. Currently, RefSeq consists of approximately 6,300 human mRNA sequences and approximately 2,000 mouse mRNA sequences. Since each RefSeq entry requires human review in order to meet desired quality standards, the rate of introduction of new RefSeq sequences is limited. At the same time, high-throughput genomic sequencing efforts, along with highthroughput annotation, are increasing the rate of introduction of (putative) new genes, full-length and partial, into GenBank. As such, it is unlikely that RefSeq will provide a comprehensive collection of transcript sequence from GenBank for some time. In order to provide a comprehensive repository of transcript sequences to support our internal projects, we have developed a system, the Transcript Sequence Registry (TSR), which automatically excises transcript sequences from both RefSeq and GenBank, and then stores the sequences, along with associated information, in a relational database. The Transcript Sequence Registry is intended to be the foundation of a gene-oriented resource; though unlike UniGene [Wheeler et al., 2000], expressed sequence tags (EST) are excluded.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)

Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...

متن کامل

Molecular cloning and characterization of a murine AIDS virus-related endogenous transcript expressed in C57BL/6 mice.

The murine AIDS (MAIDS) virus has a unique sequence in the gag p12 region, which could be responsible for MAIDS development. RNA preparations from the spleens of normal uninfected C57BL/6 mice contain a transcript hybridizing with this sequence. Levels of the transcript in the kidney of C57BL/6 mice were higher than in the spleen, liver or thymus. Although BALB/c, NFS, DBA/2 and SL murine strai...

متن کامل

DNA sequence analysis of crown gall tumor T-DNA encoding the 0.7 kb transcript.

Crown gall tumor formation involves integration into the plant genome of DNA sequences (the T-region) of tumor-inducing (Ti) plasmids present in Agrobacterium tumefaciens. The T-DNA of the tumor expresses several gene products. Little is known about the function or regulation of expression of the 0.7kb transcript, which represents a relatively abundant T-DNA transcript in octopine-type tumors. ...

متن کامل

Cloning and sequencing of rainbow trout (Oncorhynchus mykiss) interferon regulatory factor 7

  Interferon regulatory factor 7 (IRF7) gene was cloned from a subtractive cDNA library constructed with mRNAs obtained from rainbow trout (Oncorhynchus mykiss) macrophage cell line (RTS-11). Using expressed sequence tag clones of submitted IRF7 amino acid sequences, specific primers were designed. Results showed that IRF7 cDNA contains an ORF of 1251 nucleotides that translates into a 416 resi...

متن کامل

Genome-Wide Transcript Profiling Reveals the Coevolution of Plastid Gene Sequences and Transcript Processing Pathways in the Fucoxanthin Dinoflagellate Karlodinium veneficum

Plastids utilize a complex gene expression machinery, which has coevolved with the underlying genome sequence. Relatively, little is known about the genome-wide evolution of transcript processing in algal plastids that have undergone complex endosymbiotic events. We present the first genome-wide study of transcript processing in a plastid acquired through serial endosymbiosis, in the fucoxanthi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000